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Abstract — We resolve the question of optimality for a well- 
studied packetized implementation of random linear network 
coding, called PNC. In PNC, in contrast to the classical mem- 
oryless setting, nodes store received information in memory to 
later produce coded packets that reflect this information. PNC 
is known to achieve order optimal stopping times for the many- 
to-all multicast problem in many settings. 

We give a reduction that captures exactly how PNC and other 
network coding protocols use the memory of the nodes. More 
precisely, we show that any such protocol implementation induces 
a transformation which maps an execution of the protocol to 
an instance of the classical memoryless setting. This allows us 
to prove that, for any (non-adaptive dynamic) network, PNC 
converges with high probability in optimal time. In other words, 
it stops at exactly the first time in which in hindsight it was 
possible to route information from the sources to each receiver 
individually. 

Our technique also applies to variants of PNC in which each 
node uses only a finite buffer. We show that, even in this setting, 
PNC stops exactly within the time in which in hindsight it was 
possible to route packets given the memory constraint, i.e., that 
the memory used at each node never exceeds its buffer size. 
This shows that PNC, even without any feedback or explicit 
memory management, allows to keep minimal buffer sizes while 
maintaining its capacity achieving performance. 

I. Introduction 

It is by now a classical result [1] that linear network 
coding archives capacity for multicast and that even choosing 
a random linear code suffices with high probability [2], [3]. 
The rateless and self-adaptive nature of random linear network 
coding has been shown particularly beneficial in distributed 
settings with time-varying network topologies. For these set- 
tings, a distributed and packetized network coding (PNC) 
implementation has been proposed [4], [5] in which nodes 
keep received packets in memory and forward random linear 
combinations of these packets whenever they send a packet. 
The performance of PNC has since been studied in various 
network and communication models such as: static networks 
with losses [5]-[8], gossip networks [9]-[12] or adversarial 
dynamic networks [13], [14]. These works feature new and in- 
teresting techniques such as connections to queuing networks 
and Jackson's Theorem [5], [12], or other stochastic modeling 
[9], [14] and prove (asymptotic) order optimal stopping times 
that are tight in worst-case examples. 

We show that the optimality of PNC can be understood 
via a reduction to the classical memoryless setting [2], [3]. 
We prove that in any network model, whether it is static, 
stochastic or non-adaptively adversarial, PNC converges with 
high probability in optimal time: With high probability PNC 



delivers all information to all receivers at exactly the first 
time-step in which in hindsight it was possible to route this 
information from the sources to each receiver individually. 

Our reduction shows that, for any network coding protocol, 
it is possible to describe a transformation that captures exactly 
how the memory of the nodes is used. The transformation, 
which is induced by a concrete protocol implementation, maps 
an execution of the protocol to an instance of the classical 
memoryless setting. This technique also applies to variants 
of PNC [15], [16] in which each node only keeps a finite 
amount of packets in active memory. We show that, even in 
this setting, PNC stops exactly within the time in which in 
hindsight it was possible to route packets given the memory 
constraint, i.e., that the buffer at each node never exceeds its 
active memory size. This shows that PNC, even without any 
feedback or explicit memory management [17], optimally uses 
the limited buffers. 

This paper is organized as follows. We provide a short 
review of the memoryless network coding results and PNC in 
Section II. In Section III, we introduce our network model. In 
Section IV, we present our method to transform a protocol 
execution into a circuit that captures exactly how a given 
protocol implementation uses memory. Using this, we prove 
that PNC and several of its variants are optimal in Section 
IV-C. Finally, we summarize our contributions in Section V. 

II. Network Coding Review 

A. Memoryless Network Coding Setting 

In the memoryless network coding setting [ l]-[3], a directed 
acyclic circuit processes messages from a finite field Fq (or 
alternatively F^^). A circuit is a directed acyclic hypergraph 
C — {V,A). For each node v G V, we denote r+(u) as the 
incoming hyperedges, and r^(t;) as the outgoing hyperedges. 
For each e G T~(v), v contains a coding vector Ce G 
We assume that there is only one node with exclusively 
outgoing hyperedges, the source node s G V. Assuming an 
assignment of a message val{e) G F^ to each hyperedge 
e G r^(s), the circuit C processes information as follows. 
Each hyperedge can inductively be assigned a message in Fq 
by using the rule that the vector associated with an outgoing 
hyperedge e of u is Cg • val(r^{v)). In this way, C defines 
a linear transform T{T^ (s), E') G Fg (^'^^ between the 
messages val{r^ {s)) and the messages assigned to any subset 
of hyperedges E' C E. 



Reference [1] shows that if the field size q is large enough, 
one can choose the c,. such that the rank of T{r~ (s), E') 
is equal to the min-cut between s and E' in C. In such a 
case, any node v with a min-cut of at least |r~(s)| can invert 
T(r~(s), r+(w)) and decode all messages val{r~ (s)). Fur- 
thermore [2], [3] show that, with high probability, this remains 
true even if the coding coefficients are chosen uniformly at 
random. These are the classical results on (random linear) 
network coding that started this line of research. 

Note that, in this model, timing is irrelevant and each node 
processes each message only once. References [I], [^] show 
that this setting can be extended to non-acyclic circuits with 
delays. Nonetheless, nodes remain stateless and memoryless, 
which is why we refer to these networks as circuits. 

B. PNC: Distributed Packetized Network Coding 

In this section, we introduce the PNC protocol [4], [5] 
in which, in contrast to the memoryless setting, nodes store 
received information in memory to later produce coded packets 
that reflect this information. 

Assume that there are k messages from distributed to 
the nodes. If the PNC protocol is used in a network, any node 
u communicates by sending packets that contain vectors from 
pfc+i ^jjj maintains a subset 5„ C F^'+' of received packets. 
Initially, Su is empty for all nodes u. When node u initially 
knows the i*'' message G F^^, it adds the vector (6^,3^) to 
Su, where is the i*'' unit vector in F^'. If node u is requested 
to send a packet it sends a random vector from the span of 
Su- Note that this description is completely independent of 
any assumption on the network. 

If enough communication takes place among nodes for the 
system to "mix", then for each node u the subspace spanned 
by Su will converge to the k dimensional subspace of F^'+' 
given by the k input vectors. Each node can then use Gaussian 
elimination to recover the input messages. 

References [9]-[13] provide upper bounds on how quickly 
this "mixing" happens for specific (stochastic) communication 
models. In this work, we prove a stronger statement that the 
mixing happens with high probability in optimal time for any 
communication history. 

III. Network Model: Time Expanded Hypergraphs 

We consider discrete or continuous time dynamic net- 
work topologies where communication links are established 
synchronously and/or asynchronously. Nodes can potentially 
send data at different and highly non-regular rates. Links are 
assumed to have varying delays. We also incorporate broadcast 
constraints that arise in wireless settings. Our model applies 
to any static or stochastic model, including arbitrary stochastic 
link failures, and to adversarial worst-case communication 
schedules chosen by an oblivious adversary. All these models 
specify a (distribution over) communication schedules that is 
independent from the randomness in the coding coefficients. 
We shall prove a point-wise optimality, i.e., for any instance 
of a communication schedule, PNC achieves optimal perfor- 
mance. Therefore, throughout the rest of the paper, we assume 



that there is a specific given communication schedule on which 
we have to give an optimality proof. 

Each communication schedule can be specified as a se- 
quence of events, where a node sends or receives packets. 
We assume that, at each time, a node either transmits or 
receives a packet. We capture these events using the following 
definition of a time expanded communication hypergraph. This 
notion of time expanded hypergraph has been previously used 
under different names, e.g. continuous trellis [o] or adversarial 
schedule' [14]. 

Definition III.l (Time Expanded Hypergraph). Consider a 
network with n nodes, and denote this set of nodes as V. 
A communication schedule from time to t among nodes in 
V is captured by the following time expanded hypergraph 
G = {y,V',A). Let V ^ V be a node in the network. We 
create a vertex copy Vf G V for every time t' g [0, t] 
when the node v receives or sends at least one packet. If v is 
transmitting at time t' to nodes , ...,u^ with associated 

delay Ai, A2, Af, respectively, we create a single hyper- 
edge {vf , {u\,^^^,ul_^_^^, ...,u\,^^^}) e A. 

Given a network, we consider the following (distributed) 
many-to-many multicast problem. Messages are generated 
at nodes in the network. A message can be generated at 
(multiple) different times at multiple nodes. The goal is to 
disseminate all the messages to all nodes (or a subset of 
destination nodes D C y) as fast as possible. One example of 
an application of this problem could be a source distributing a 
large file (which is divided into small parts) to many receivers. 
Another application is in sensor networks, where each sensor 
transmits its measurements at different times. 

To formalize this problem, we assume that there are exactly 
k messages that are vectors of F^. We assume that the nodes 
employ the PNC protocol of Section II-B. Note that this 
requires each message to have a unique identifier that is 
known to every node at which the messages is generated. 
We incorporate the message generation in our network model 
using the following additional definition. 

Definition III.2. Let G = {V,V'tA) be a communication 
schedule of a network in which k messages mi, . . . , G F^ 
are generated. We alter G by adding a supersource node s 
to V'. Furthermore for each message mi that is generated 
by nodes u^^u^,... at time ti,t2,--- we add a hyperedge 
to A. 

IV. Our Results 

Given an adversarial schedule and an initial message distri- 
bution, the network capacity between the source and any node 
at any time can be determined. To do so, one enriches the 
time expanded hypergraph by memory edges, which capture 
the possibility that nodes store knowledge over time. This is 

'indeed, Theorem 3.9. in [ ' -i] states a result similar to our main theorem for 
PNC but with a weaker bound. Instead of proving PNC to be exactly capacity 
achieving with failure probability e = l/poly{n), their result requires at least 
pi- (log k + log e^^) additional capacity. In general, p and I can be of the 
order of k or even larger making this bound quite loose. 



(a) Network transactions over time 



(b) Time expanded liypergraph (c) Information flow hypergraph 



(d) PNC Transform 



Fig. 1: An example network G with V = {a, 6, c, d}. In Figure la, the network communication history is shown in sequence. 
The link delays are shown on the edges. In Figure lb, the time expanded hypergraph of the network in Figure la is shown 
assuming that node h and c start with a message at time ti. In Figure Ic, we show the corresponding natural information flow 
graph Goo- In Figure Id shows the corresponding PNC transform GpNC- 



achieved by connecting each node vt in the time expanded 
hypergraph to its next copy in time Uj/ with an edge with 
capacity equal to the amount of information that v can store, 
i.e., its buffer size /i (in packets). We assume for simplicity 
that all nodes have the same amount of memory /i. If all 
nodes have unlimited buffers, we follow [S] and set yu = oo. 
We call this enriched time expanded hypergraph the (natural) 
information flow graph and denote it by G^. The next lemma 
confirms the intuition that the information flow graph indeed 
represents an upper-bound on the amount of information that 
can be transmitted by any algorithm. 

Lemma IV.l. Let G be the time expanded hypergraph for a 
communication schedule and let Gf^ be its natural information 
flow graph. The the min-cut between the supersource s and a 
node Vt in G^ is an upper bound on the amount of information 
that any algorithm can transmit from the sources to node v 
by time t if all nodes have an active memory of at most fi. 

While Lemma IV.l provides a simple upper bound on 
the achievable point-to-point capacity, the more interesting 
question is whether a given protocol achieves this capacity. 
While one would hope that the optimality of random linear 
network coding carries over from the memoryless setting [2], 
it is not difficult to find protocols that do not achieve this 
capacity, e.g., the shift-register finite memory network coding 
protocol in [16]. In the case of the PNC protocol, several 
results have shown (asymptotic) order optimality in specific 
stochastic settings or upper bounds on the stopping 

time in hypergraph theoretic parameters of the topology that 
are tight up to constant factors in worst-case examples [9]- 
[14]. In the next section, we provide a simpler and a more 
general approach that proves optimality in all the above cases 
(albeit without providing any bound for concrete stopping 
times in specific models). 

A. General Approach 

We show that, for many network coding protocols, it is 
possible to systematically transform the time expanded hyper- 
graph into a circuit that exactly captures how the protocol uses 
memory. Given a protocol, a communication schedule, and 
the corresponding circuit, we prove optimality in three steps. 
We first show that the circuit indeed simulates the execution 
of protocol; then apply the results from [2] for memoryless 



circuits to show that the protocol achieves the min-cut of this 
circuit with high probability; and finally show that the min-cut 
of the circuit is equivalent to the min-cut in G^. 

To describe our transforms, we note that many network 
coding protocol proposed so far [4], [15], [16] are composed 
of two elementary operations: 1) coding packets together by 
taking a random linear combination of them, and 2) storing 
packets in memory. While the coding operation is already 
naturally captured by the memoryless circuits we show that the 
storing operation can be simulated by extending a hyperedge 
(representing a transmission) to all future versions of the 
recipient(s). Using this observation, we define a hypergraph 
transformation {.)x for any given protocol implementation 
X. This transformation takes a time expanded hypergraph G 
and transforms it to the hypergraph Gx that exactly captures 
the execution of protocol X on the communication schedule 
G. Note that the hypergraph transformation {.)x does not 
just depend on the amount of memory X uses but has to 
be carefully designed to match the implementation details of 
protocol X. 

B. Protocols and their Transformations 

In this section, we describe the transforms for several 
protocols. We start with the PNC-protocol from Section II-B 
and then cover two network coding protocols described in 
[15]: the /x-recombinator and the /^-accumulator protocols. 
Both protocols are highly efficient variants of PNC, for which 
any node only stores /i packets in its buffer. Besides reducing 
the required memory resources, this also improves the com- 
putational cost of network coding, because of the reduced 
amount of information each coding operation is performed 
over The two protocols differ in the way the new set of ji 
packets is obtained after a reception of a new packet (and/or 
generation of a new packet). The /x-recombinator simply picks 
^ random packets from the span on the stored packets and 
the received packets while the more efficient /x- accumulator 
randomly combines the incoming packet with each of the ji 
stored packet individually. The next two definitions present the 
transformations for the PNC protocol and the yU-recombinator 
protocol. 

Definition IV.2 (PNC transform). The PNC-transform Gpnc 
of a time expanded hypergraph G = {V,V',A) is formed 
by replacing every hyperedge e £ A by it memory closure 
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Fig. 2: Template for (O^i-accumuiator with fj, ^ 3: The fj, black 
nodes represent the memory and the gray nodes represent 
transmissions. 

e. Here the memory-closure of a hyperedge e = {vt,Re) = 
(wt, ...Uj^}) £ A is defined as e = (vt,Re) where 

i?g = \ut' I 3u, t : Ut £ Re and t' > t}. In other words, we 
extend every hyperedge e to include all future copies of the 
recipients. 

Definition IV.3 (yu-recombinator transform). The ^-recombi- 
nator transform G^.^g^^^^^^j^^jf^^ of a time expanded hyper- 
graph G = (y, V , A) is formed by adding /i edges from every 
vertex Vt £ V to its next copy in time Vt' where t' is the 
smallest t" > t with Vf £ V'. 

Note that the two transforms, GpNC and G^-i-ecombinator, 
have an intuitive structure. Extending a hyperedge in GpNC 
can be interpreted as changing the storage operation of nodesto 
requesting/receiving the exact same packet again whenever the 
"stored" packet is used. For G^-recombinatm-, the fi memory 
edges represent that the /i "stored" packets are used to generate 
the next ^ random packets to be stored. 

Note that, in general, the network transforms are not 
necessarily as natural and straight-forward as suggested by 
Definitions IV.2 and IV.3. One has to be very careful to specify 
and map all implementation details. Indeed, the transform 
presented in Definition IV.3 does not exactly capture the pro- 
tocol described in [' 5] but instead also recombines its stored 
packets whenever a packet is send. For simplicity, we consider 
this variant of the recombinator protocol here. As a final 
example for a slightly more complicated transformation, we 
pictorially describe the /^-accumulator transform. We consider 
the implementation described in [15], [16] in which a random 
multiple of the received packet(s) is added to each stored 
packet. Its network transform G^.accumuiator is formed by first 
taking the GpNC and then replacing each node according to 
the template in Figure 2. 

C. Simulation and Optimality Proofs 

Showing that a protocol implementation and its induced 
hypergraph transformation match is almost always a straight 
forward inductive proof: 

Tlieorem IV.4. Consider a network using the PNC protocol, 
and let G be the corresponding time-expanded hypergraph 
with supersource s. Consider the PNC transform GpNC os 
a circuit as in Section II-A. If the coding vectors for this 
circuit are selected independently and uniformly from Fq 
then this simulates the behavior of the PNC protocol. The 
message associated with each circuit hyperedge e = {vt,Re) 
in GpNC is the message sent by node v at time t. Furthermore, 



the messages on the incoming hyperedges of Vt in GpNC 
correspond to the messages stored in memory of node v at 
time t. 

Proof: For sake of space we present only a proof sketch; 
In order to prove that the circuit GpMC simulates the execution 
of the PNC protocol, we need to specify carefully how the 
randomness used on both sides. For the PNC protocol we 
assume that a node keeps all received packets (and does not, 
e.g., keep only innovative packets) and creates any coded 
packet by drawing random coding coefficient for the packets 
in the order they were received. We similarly fix the process 
of choosing the random coding vectors for the circuit to make 
it match with the PNC protocol. 

Now using an inductive proof over the time (or the topo- 
logical depth of the nodes in G), we can show that Gpnc 
simulates the PNC protocol. Firstly, the messages associated 
with the outgoing hyperedges of the supersource s are by def- 
inition the messages generated by the sources. Now consider 
a node v at time t. We assume, without loss of generality, that 
no node sends a packet when it has not received or generated a 
message. Thus, Vt has at least one incoming hyperedge from 
another node Ut' where t' < t. By construction of Gp^c, 
the incoming hyperedges to vt are from all nodes that have 
sent a packet to v before time t. By induction hypothesis, the 
incoming hyperedges of v correspond to the messages stored 
in V in the PNC protocol at time t. Since both the circuit 
GpNC and the PNC protocol linearly combine packets using 
the same random coefficients, the hypothesis holds for the 
packets created at node v at time t. ■ 

Given Gx as a representation of the execution of X on the 
communication schedule G it is easy to state and proof an 
equivalent of Lemma IV. 1 : The amount of source information 
transmitted from s to v at time t via protocol X is at most 
the (s, ut)-min-cut in Gx- More interestingly, since Gx is 
memoryless, we can directly apply the results of [2] to show 
the converse: 

Lemma IV.5. Let G be the time expanded hypergraph for a 
communication schedule and let Gx be its transform for the 
network coding protocol X. With probability 1 — e, the amount 
of information transmitted from the sources to node v by time 
t is exactly the min-cut between the supersource s and a node 
Vt in Gx- Here e = 0(1/ poly (n)) is an arbitrarily small 
inverse polynomial probability given that the coefficient size 
logq used in X is 0(log7i). 

All that is left to check is that for the protocols presented 
here this min-cut is indeed the same as the information 
theoretical optimum as given by G,, in Lemma IV. 1 : 

Lemma IV.6. Let G be any time expanded hypergraph with 
supersource s. The min-cut between the supersource s and 
any node Vt is the same in Goo cind GpMC- Furthermore, the 

same is true for G ^, G ^^i-ecombinator> and G ^-accumulator- 
Proof: We begin with the equivalence of Goo and GpMC- 
For this we transform any integral flow in Goo to a valid flow 



in GpNC and vice versa. Then, we use the min-cut max-flow 
theorem. The transformation operates on each path in a flow 
decomposition separately and repeatedly removes flow from 
cxD-edges. Consider a flow-carrying unit-capacity hyperedge 
{ut,Wt') with an cxD-capacity memory-edge {wt',wt") imme- 
diately following it {t < t' < t"). We eliminate such oo-edges 
one-by-one by rerouting the flow directly through ut to Wf 
using the extended hyperedges in GpNC- This process is flow 
preserving, respects capacities, and eliminates all cx)-edges 
since every flow path starts with an unit-capacity outgoing 
hyperedge of s. It can be verified that this transformation is 
also reversible; thus, gives a bijection between integral (s, vt)- 
flows in Goo and integral (s, wt)-flows in GpNC- This finishes 
the proof for GpNC- 

For G^^recombinator, onc Can usc the Same strategy, and re- 
route the flow over the /i-capacity memory-edges in G^ to 
the fj, unit-capacity edges in G^.recombinator- 

Similary, for G^-accumuiator, we first re-route the flow over the 
yLt-capacity memory-edges in G^ via the extended hyperedges 
in Gp-accuimiiatoi Created by the PNC transformation. After the 
PNC transformation, G^-accumuiator is formed by replacing each 
node according to the template in Figure 2. In G^-accumuiatm, 
we can re-route the flows of each replaced node since each 
node vt in G^ carries at most /i-units of flow. This is true by 
construction: if a node v is receiving at time t, node Vt has 
one out-going memory-edge with capacity /i; if a node v is 
transmitting at time t, then node vt has only one in-coming 
memory-edge with capacity /i. ■ 

Putting everything together finishes our main theorem: 

Theorem IV.7. Assume a network and communication model 
in which the random coding coefficients are independent 
from the communication schedule. With high probability, the 
PNC, the ii-recombinator, and the accumulator protocols 
disseminate exactly the maximum amount of information from 
the sources to every node that any protocol using fi memory 
could have disseminated. 

V. Conclusion 

In this paper, we resolve the question of optimality for 
the well-studied PNC protocol and similar network coding 
protocols. These protocols use the memory of nodes to pro- 
duce coded packets that reflect everything received so far. We 
show that an implementation of such a protocol induces a 
transformation that maps any execution to an instance of the 
classical memoryless setting. This shows that PNC solves the 
many-to-all multicast problem in any non-adaptive dynamic 
network model in optimal time. Differently phrased, PNC 
makes on-the-fly the optimal decision of what information 
a node should send out without knowing anything about the 
network topology or even which other node will receive this 
information. 

Even more interestingly, this remains true if one restricts 
the nodes to use limited size buffers. We show that both the 
/i-recombinator and the /x-accumulator protocol [15] achieve 
optimal performance in this setting: with high probability 



they stop exactly within the time in which in hindsight it 
was possible to route packets given the buffer constraint, i.e., 
given that the buffer at each node never exceeds the limit. 
Alternatively, one can interpret this result as PNC making on- 
the-fly optimal decisions on which information to keep in the 
limited memory. This shows that, even without any feedback 
[17] or complicated explicit memory management, these PNC 
variants preserve the capacity achieving performance of PNC 
as long as minimal buffer sizes are available. 

This paper also implies that determining stopping times for 
PNC is equivalent to determining the connectivity of a network 
or, more generally, to determining the network capacity. For 
many settings, obtaining good bounds or characterizations for 
the network capacity remains an interesting open question. 
Recently, significant progress was made in this direction for 
both the PNC protocol [13] and its finite memory variants [15]. 
We are hopeful that the insights provided here will be helpful 
in further advances. 
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